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® £ NA molecu,es comprising the genes for preprochymosln and its maturation forms, and microorganisms transformed 
thereby. 

© The present invention relates to recombinant DNA 
and plasmfds comprising specific structural genes of 
mammalian origin coding for the various allelic and matu- 
ration forms of preprochymosin, particularly those of 
bovine origin, and the use of said recombinant plasmids 
to transform microorganisms in which said genes are 
expressed. 
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DNA MOLECULES COMPRISING THE VARIOUS ALLELIC FORMS OF THE STRUCTURAL 
GENES ENCODING THE MAMMALIAN PROTEINS PREPROCHYMOSIN, PROCHYMOSIN^ 
P SEUDOCHYMOSIN AND CHYMOSIN; RECOMBINANT DNA CLONING VEHICLES COM- 
PRISING SAID DNA MOLECULES, MICROORGANISMS TRANSFORMED THEREBY AND 
SYNTHESIS OF SAID PROTEINS IN THESE MICROBIAL HOSTS 

The present invention relates to recombinant DNA and plasmids comprising 
specific structural genes of mammalian origin coding for the various 
allelic and maturation forms of preprochymosin, particularly those of 
bovine origin, and the use of said recombinant plasmids to transform 
microorganisms in which said genes are expressed. 

Chymosin is a protein originating from the stomach of newborn mammals. 
The bovine type (EC 3.4.23.4) is secreted as an inactive precursor, 
prochymosin, which consists of a single polypeptide chain of 365 amino 
acid residues (B. Foltmann et al.\ Proc. Natl. Acad. Sci. USA; 74, 
2321-2324, 1977; B. Foltmann ot at. ; J. Biol. Chemistry; 254, 8447-8456, 
1979). Applicants have found that the precursor of bovine prochymosin, 
preprochymosin, consists of a single peptide chain of 381 amino acid 
residues and differs from prochymosin in an ami no- terminal of 16 amino 
acids. This extension very much resembles a signal sequence which is 
involved in the process of cotranslational excretion (G.Blobel & 
U.Oobberstein, J. Cell. Biol. 67, 835-851, 1975). 

Prochymosin is irreversibly converted into active enzyme (chymosin) by 
limited proteolysis, during which a total of 42 amino acid residues are 
released from the ami no-terminal part of the peptide chain. The activa- 
tion is effected through a pH-dependent two-step autocatalytic conversion. 
The intermediate, pseudochymosin, is formed by proteolytic cleavagr; of 
bond 27-28 at pH 2-3. The final product, chymosin, is formed by activa- 
tion of pseudochymosin at pH 4-5. (V. Barkholt Pettersen «t ul. , Eur. J. 
Biochem. , 94, 573-580, 1979). 

The enzymatic activity of chymosin consists of the specific proteolysis 
of K-casein. This property makes chymosin widely used as mi lk-clotting 
eruyme in cheese manufacture. 

Chymosin is the essential milk-clotting component of rennet, the crude 
extract of the abomasum of bovine calves. Rennet is used in the produc- 
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tion -of several types of cheeses in almost'all parts of the world. 
Because of " the ever increasing shortage of calf rennet created by the 
increasing -demand for cheese , many laboratories have been searching 
5 for substistutes of microbial origin.. Many microbial proteases 

have been screened, only a few could be used in cheese making, and 
even these substitutes exhibit a different specificity and therefore 
may cause an unacceptable texture and/or bitter taste of the cheese/ 
So the production of chymosin by recombinant DNA-containing micro- 

10 organisms is expected to become of great economical importance. 

,f r - - 

Developments in recombinant DNA technology have made it possible to 
isolate or synthesize specific genes or portions thereof from higher 
organisms, such as man and other mammals, and to transfer these genes 

15 or fragments to microorganisms such as bacteria or yeasts. The trans- 
ferred gene is replicated and propagated as the transformed micro- 
organism replicates. As a result, the transformed microorganism may. 
become endowed with the capacity to make whatever protein the gene 
or gene fragment encodes, whether it is an enzyme, a hormone, an anti- 

20 gen, an antibody, or a portion thereof. The microorganism passes on 
this capabil ity to its progeny, so that in effect, the transfer has 
resulted in a new microbial strain, having the described capability. 
See, for example, Ullrich, A. at al. , Science 196 , 1313 (1977), and 
Seeburg, P.H. et at. , Nature 270 , 486 (1977). A basic fact underlying 

25 the application of this technology for practical purposes is that DNA 
of all living organisms, from microbes to man, is chemically similar, 
being composed of the same four nucleotides. The significant differ- 
ences lie in the sequences of these nucleotides in the polymeric DNA 
molecule. The nucleotide sequences are mainly used to specify the 

30 aminto acid sequences of proteins that comprise the organism. Although 
most of the proteins of different organisms differ from each other, 
the coding relationship between rsucleotide sequence and amino acid 
sequence is fundamentally/ the se-^e for all organisms, 

35 For example, the same nucleotide sequence \:hzC.i cedes for the amino 

acid sequence of HGH in human pituitary cells, will, when transferred 
to a microorganism, be recognized as coding for the same amino acid 
sequence. 

40 For economic reasons 'it is important that proteins encoded by the 
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recombinant DNA gene are produced under optimal conditions in approved 
edible micro-organisms as host cells. The main routes to achieve this are 

(1) integration of the structural gene down-stream of an effective 
regulon, in such a way that under selected growth conditions, the amount 
of protein produced per cell (by an optimal number of cells) is as high 
as possible. 

For that purpose regulons like the double lac UV5, the trp regulon, the 
double trp regulon of E. coli and the regulon of the gene VIII product 
of the bacteriophages M13, fd and fl are, amongst others, adequate in 
their natural state or in their modified form(s). 

Another factor influencing the yield of required protein per cell is 
the increase (amplification) of the copy number of the pi asm ids con- 
taining the above-described regulons and the structural genes. Ampli- 
fication can be effected by the use of a thermosensiti ve replication 
mutant derived from the cloacin DF 13 plasmid (pVU 208). 

(2) excretion of said protein by microbial host cells into their 
periplasmic space and/or into the culturing medium, thus preventing 

said protein from intracellular degradation or preventing the disturbance 
of normal cellular processes due to too high an intracellular level of 
said protein. It is now generally accepted that in many prokaryotic and 
eukaryotic cells a special NH2- terminal amino acid sequence of the 
unprocessed form of the proteins is involved in the protein excretion 
process . 

G.Blobel & B.Dobberstein (1975), J. Cell Biol. 67, 835-851. 
In the present invention use is made of recombinant DNA and other 
molecular biological techniques to construct recombinant DNA molecules 
that fulfil the above-described requirements. The present invention is 
also related to the change of the genetic information of structural 
genes using site-directed mutagenesis. 

For a better understanding of the invention the most important terms 
used in the description will be defined : 

An operon is a gene comprising a particular DNA sequence (structural) 
gene(s) (For polypeptide(s) expression) and a control region or regulon 
(regulating said expression) anci mostly consisting of a promotor 
sequence, an operator sequence, a ribosome binding- or interaction DNA 
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Structural genes are DNA sequences which encode through a template 
(mRNA) a sequence of amino acids characteristic for a specific poly- 
peptide. 

A promoter is a DNA sequence within the regulon to which RNA polymerase 
binds for the initiation of the transcription. 

An operator is a DNA sequence within the regulon to which a repressor 
protein may bind, thus preventing RNA polymerase from binding to the 
adjacent promoter. 

15 An inducer is a substance which deactivates a repressor protein, 
freeing the operator and permitting RNA polymerase to bind to the 
promoter and start transcription. 

Cloning vehicle . A non-chromosomal double-stranded DNA, plasmid or 
20 phage, comprising a DNA sequence (intact replicon) that allows self- 
replication after transformation into suitable host cells. 



Phage or bacteriophage . Bacterial virus which can replicate in a 
suitable bacterial host cell. 

25 

Reading frame . The grouping of triplets of nucleotides (codons) into 
such a frame that at mRNA level a proper translation of the codons 
into the polypeptide takes place. 

30 Transcription . The Process of producing RNA from a structural gene. 

Translation . The process of producing a polypeptide from mRNA. 

Expression . The process undergone by a structural gene to produce a 
35 polypeptide. It is a combination of many processes, including at 
least transcription and translation. 

Thermosensitive replication muta nt. A plasmid containing a mutation 
in its replication origin which causes its copy number to be 
40 temperature dependent. 
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Allel ic f orm- One of two or more naturally occurring alternative forms 
of a gene product. 

Maturation form . One of two or more naturally occurring forms of a gene 
product procured by specific processing, e.g. specific proteolysis. 

Plus strand . DNA strand whose nucleotide sequence is identical with the 
nRNA sequence, with the proviso that uracil is replaced by thymidine. 

By maturation forms of preprochymosin are meant prochymosin, pseudo- 
chymosin and chymosin. 

Prochymosin arises through the action of signal peptidase on prepro- 
chymosin, which causes the loss of the amino terminal - excretion 
related - signal sequence. 

The chemical structure of bovine preprochymosin is given in Fig. 1. 
Bovine prochymosin corresponds with residues 1-365 (Fig. 1). 
Pseudochymosin arises through the autocatalytic proteolysis of pro- 
chymosin at pH 2, causing the loss of an amino terminal portion of 
prochymosin. The chemical structure of bovine pseudochymosin corresponds 
with residues 28-365 of preprochymosin given in Fig. 1. 
Chymosin arises through the autocatalytic proteolysis of pseudochymosin 
at pH 4-5, causing the loss of an amino terminal portion of pseudo- 
chymosin. The chemical structure of bovine chymosin corresponds with 
residues 43-365 of preprochymosin given in Fig. 1. 

According to the invention there is provided structural gene(s) coding for 
the various allelic and maturation forms of mammalian preprochymosin » 
particularly bovine preprochymosin according to Fig. 1 and 2; and further 
a recombinant plasmid comprising : 

(i) structural gene(s) coding for the various allelic and maturation 
forms of mammalian preprochymosin , particularly bovine preprochymosin 
according to Fig. 1 and 2; 

(ii) specific DNA sequences which regulate the expression of said 
structural genes in a microbial host. 

These specific DNA sequences consist of either an inducible or a 
constitutive regulon. A preferred inducible regulon consists of a 
double lac UV5 system as described by D.V.Goeddel et al. , Nature 281, 
544-548 (1978). (See Fig. 8). 
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anotner oref erred inducible regulon is a constituent of the tryptophan 
system described by F. Lee .,.[.., J. MoT. Biol. 221, 193-217 (1978) 
and K. Bertrand *i *l. , Science 139, 22-26 (1975). Applicants have 
modified this tryptophan system to obtain a more adequate system 
according to Fig. 9 . In this modified system the information coding 
for the trp attenuator protein is eliminated while maintaining its 
ribosome-binding site. Expression is highly increased when two trp 
regulons in a head to tail fashion are being used. Synthesis of this 
system is illustrated in Fig. 11. 

The recombinant plasmid according to the invention may comprise DNA 
sequences which regulate the expression of the structural genes, 
preferably consisting of a modified promoter/ribosome-binding site of 
gene VIII of bacteriophage M13, fd or fl (P.M.G.F. van Wezenbeek et 
at., Gene II, 129-148 (1980)). 

Efficiency of required protein per cell was highly increased when in 
conjunction with the regulon systems described above, the copy number 
of the recombinant cloning vehicle was increased. This was effected 
by the use of a thermosensitive replication mutant derived from the 
cloacin DF13 plasmid pVU 208 (A. Stuitje, thesis, V.U. Amsterdam, 1981). 
Increase in temperature results in a ten- to hundred-fold increase in 
copy number. The construction of such plasmids is illustrated in Fig. 
12 and 13. 

In the recombinant plasmid according to the invention the regulon may 
be either directly linked to the structural gene or indirectly through 
a novel start codon and EcoRI-site .containing DNA linker comprising 
the nucleotide sequence 5'p CAT(N) n GAATTC(N' ) n ATG (3') wherein 
n = 0, 1, 2 or 3 and OH 
N and N' are any of the nucleotides A, T, G or C, with the proviso 
that in the double-stranded structure N and N' are such that a 
rotational symmetrical structure is present. 

By a rotational symmetrical structure is meant that where N is e.g. 
represented by A, N 1 should be represented by the complementary base T. 
In some instances it turned out that the yield of expression improved 
when the sequence AATT between the regulon and the structural gene has 
been eliminated. 
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Allelic forms of bovine preprochymosin have been constructed departing 
from the nucleotide sequence given in Fig, 1. Examples of these con- 
structions are outlined in Fig. 18 and construction step lOe described 
further in the Specification, whereby use is made of site-directed muta- 
genesis. The latter procedure could also be applied usefully so as to 
produce preprochymosin with a specifically altered signal sequenece 
v/hich allows efficient excretion in microbial hosts and prochymosin with 
an improved acid-induced autocatalytic proteolysis characteristic. 

The microbial cloning vehicles containing the structural genes encoding 
the various allelic and maturation forms of preprochymosin according 
to the invention are produced by a number of steps, the most essential 
of which are: 

1. Isolation and enrichment of the messenger RNA(mRNA) of prepro- 
chymosin. 

2. Conversion of this mRNA into double stranded DNA (dsDNA) 

3. Construction of dsDNA having a poly-dC-tail 

4. Incorporation of the dsDNA-poly-dC molecules in Pstl-cleaved and 
poly-dG-tailed pBR 322 DNA. 

5. Transformation of competent E. eoli cells and selection of tetra- 
cycline resistant colonies. 

6. Determination of the nature of the inserts by RNA/DNA hybridization 
and in vitro translation; and by DNA/DNA hybridization using a 
specific J<L P-labelled cDNA probe. 

7. Double checking the nature of the cloned Pstl-inserts by DNA- and 
RNA-sequence analysis. 

8a. Producing DNA encoding the amino terminal part of pseudochymosin 
plus a trans! ational initiation ATG-codon (added) at the amino 
te minus. 

8b. Producing DNA encoding the amino terminal part of chymosin plus 
a transl ational initiation ATG-codon at the amino terminus. 

8c. Producing DNA encoding the amino terminal part of prochymosin 
plus a translational initiation ATG-codon at the amino terminus. 

8d. Producing DNA encoding the amino terminal part of preprochymosin 
plus a transitional initiation ATG-codon at the amino terminus. 

9. Construction of plasmids comprising a constitutive or inducible 
regulon, with or without a thermosensi tive replication mutation. 

10. Construction of plasmids consisting of plasmids described under 
9 and the li gated preprochymosin gene or its various maturation 
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forms, and transformation of said plasmids into microbial host 
cells. 

11. Culturing of microbial cells, e.g. A', coii cells, containing 
recombinant plasmids described under 10 and detection and iso- 
lation of preprochymosin, prochymosin, pseudochymosin or chymosin 
therefrom. Culturing conditions were optimized as to the yield 
of preprochymosin or its maturation forms per cell. Conversion 
into enzymatical ly active chymosin of the various precursors was 
also optimized. 

The following description will illustrate the above-mentioned steps 
i n detai 1 . 

1- Isolation and purification of bovine (preprochymosin) mRNA 

The fourth stomach of a preruminant calf (abomasum, Frisian cow) 
was ground under liquid nitrogen, extracted with phenol and a 
selective precipitation of the RNAs with Li CI was performed 
following the procedure described by K.S. Kirby, Biochem. J. 96_, 
266-269 (1965), U. Wiegers & H. Hilz (FEBS. Letters, 23, 77-82 
(1972), PolyA-containing mRNA was recovered by several passages 
over oligo-dT-cellulose columns as described by H. Aviv & P. Leder 
(Proc. Natl. Acad. Sci . USA, 69, 1408-1412 (1972)). 



2. Conversion of (prepro) chymosin mRNA into double-stranded DNA 
Purified (prepro) chymosin mRNA was copied with AMV reverse tran- 
scriptase to yield a single-stranded DNA molecule, according to 
the procedure described by G.N. Buell et al. 9 J. Biol. Chem. 253 , 
2471-2482 (1978). This cDNA was subsequently converted into a 
double-stranded molecule using DNA-polymerase, according to the 
procedure described by A.R. Davis et at. > Gene 10, 205-218 (1980). 
Thereafter the loop structure of the double-stranded DNA copy v/a5 
removed by Sl-nuclease digestion. 

3. Construction of double-stranded DNA with poly-dC-t ail s 

DNA molecules of the desired length were obtained by polyacryin-nde 
gel -electrophoresis, extracted from the gel and tailed with poly-dC 
by terminal transferase according to the procedure described by 
R. Roychoudhury « al., Nucleic Acids Research 3> 863-877 (1976). 
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Integration of the dsDNA-poly- d C molecules in p lasmid pBR 322 
Plasmid pBR 322 was treated with restriction endonuclease PstI, 
that cleaves the plasmid at a recognition site that lies in the 
gene encoding the ampicillin resistance, whereafter the linearized 
DNA of pBR 322 was supplied at the Pstl-site with poly-dG-tails by 
terminal transferase. The poly-dC-tailed DNA molecules were annealed 
to the poly-dG-tailed plasmid pBR 322. 

Transformation and clone selection 

The plasmids thus obtained were transferred into CaC^-treated 
E. eoli cells. After transformation, cells containing hybrid plasmid 
DNA molecules were selected on their resistance to tetracycline. 
(M. Mandel & A. Higa, J. MoT. Biol., 53, 159-162 (1970). 

Determination of the nature of the inserts (I) DNA/ DNA colony hy- 
bridization and RNA/DNA hybridization/in vitro translation 
Positive colonies were further selected by colony hybridization 
(R.E. Thayer, Anal. Biochem., 98, 60-63 (1979)) with a radioacti vely 
labelled, chymosin-specific cDNA. The latter product was obtained 
by priming reverse transcription (see 2) of the mRNA preparation 
(see 1) with the oligonucleotide (5") dTTCATCATGTT QH (3'). 
This oligonucleotide was designed by the Applicants, because it 
represents one of the two theoretically possible nucleotide stretches 
coding for the unique amino acid sequence -asN-met-met-asN-, which 
occurs in the (prepro)chymosin molecule at position 183-186 (Fig.l). 
Upon priming cDNA synthesis with this undecanucleotide a distinct 
cDNA product with a chain length of circa 650 nucleotides was ob- 
tained. This product was isolated and used as a probe in the colony 
hybridization experiments. Several positive colonies could be ident- 
ified and, as a double check on the identity of the cloned DNA, the 
plasmid DNA from some of these colonies was isolated and used in the 
hybridizati on/ in vitro translation procedure described by J.G. 
Williams et al. (Cell, 17, 903-913 (1979)). 

Determination of the nature of the inserts (II) by DNA/RNA seque nce 
analysis (Fig. 1) 

The nucleotide sequence analysis of the (prepro)chymosin inserts 
was performed by the chemical degradation procedure as outlined by 
A.M. Maxam & W. Gilbert in Methods in Enzymology, -L. Grossman & 
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K. Moldave editors, New York, Acad. Press, 1980, Vol. 65 (1), 
pages 499-560, and the dideoxy/nick translation procedure as out- 
lined by J. Maat & A.J.H. Smith, Nucleic Acid Research, _5, 4537- 
4545 (1978). Further information on the nucleotide sequence of the 
(prepro)chymosin . mRNA was derived indirectly by primed synthesis 
by AMV-reverse transcriptase on the (prepro)chymosin mRNA template 
in the presence of chain-terminating inhibitors, as outlined by 
D. Zimmern & P. Kaesberg, Proc. Natl. Acad. Sci . U.S.A. 75^, 4257- 
4261 (1978). This screening yielded inlev alia plasmid pUR 1001 
containing an almost complete copy of preprochymosin mRNA. 

Production of DNA encoding the amino terminal part of pseudo- 
chymosin ATG (Fig. 3, 4) 

Numbers refer to the preprochymosin mRNA sequence in Fig. 1, unless 
indicated otherwise. 

Plasmid pBR 322 was cleaved with the restriction enzyme Haelll and 
the resulting fragments were subsequently blunt-end li gated with 
synthetic HindIII-1 inkers (5 1 ) dCCAAGCTTGG (3'). The mixture was 
subsequently incubated with Hindlll and phosphatase. The reactions 
were terminated by protein extraction with phenol /chloroform 
(50/50 v/v) and the DNA was then cleaved with - Pstl. The resulting 
mixture was subjected to polyacryl amide gel electrophoresis and a 
148 bp fragment (fragment A, Fig. 3) extending from position 3608- 
3756 in the pBR 322 DNA sequence (J.G. Sutcliffe, Cold Spring 
Harbor Symposia on Quantitative Biology, 43, 77-90 (1978)) was 
isolated from the gel by electroelution. 

Plasmid pUR 1001 was cleaved with EcoRI and treated with calf 
phosphatase. The mixture wns extracted with phenol/chloroform and 
subsequently Pstl was added. Resulting fragments were separated 
by agarose electrophoresis. Fragment B (see Fig. 3) extending from 
the EcoRI-site at position 549 to the Pstl-site in the noncoding 
sequence of the preprochymosin gene at the carboxy terminal end 
of the pUR 1001 clone, was isolated. 

Fragments A and B were li gated with the large EcoRI-Hindlll frag- 
ment of pUR 201, 301, 401, 303, 210, 310, 410, 311 (combined 
called fragment C) yielding pUR 1520, 1530, 1540, 1730, 1820, 
1830, 1840, 1930, respectively. 
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Subsequently plasmid pUR 1001 DNA was cleaved with PstI and result- 
ing DNA fragments were ligated to the synthetic pentanucleotide 
(5') duoCTGCA (3'). Following ligation, the mixture was incubated 
with E. coli DNA polymerase, large fragment, in the presence of 
dGTP in order to make blunt ends. The DNA was phosphorylated using 
T4 kinase and ATP, and subsequently supplied with synthetic EcoRI- 
linkers of the structure (5* )dCAT(N) n GAATTC(N' ) n ATG (3'), wherein 
n = 0, 1, 2 and 3 and N and N' are any of the deoxynucleotides A, C, 
G or T, with the proviso that in the double-stranded structure N 
and N' are such that a rotational symmetric structure is present. 
The DNA was then treated with EcoRI and resulted in fragment (I) 
of circa 400 bp in length, which was isolated (Fig. 4). 

8b - Production of DNA encoding the amino terminal part of chymosin 
(Fig. 5) 

Plasmid pUR 1001 was cleaved with PstI and the 1300 bp PstI insert 
was isolated. This DNA fragment was heat-denaturated to produce 
single strands. The synthetic primer (5') dGGGGAGGTGG (3') was 
used to produce complementary DNA synthesis starting from position 
198 in the direction of the carboxy terminus by the action of 
EcoRI DNA polymerase, large fragment. Subsequently the DNA was 
treated with nuclease SI to procure blunt-ended dsDNA. To this 
dsDNA the synthetic EcoRI-linker (5') dCAT (N ) n GAATTC ( N ' ) n ATG (3') 
was ligated. After digestion with EcoRI and treatment with phos- 
phatase, the DNA was cleaved once more by Bglll. The resulting 
fragment II (Fig. 5) was isolated. 

8c - Production of DNA encoding the amino terminal part of prochymosin 
(Fig. 6) 

Plasmid pUR 1001 was treated with HphI, followed by nuclease SI. 
A 202 base pair long fragment III was isolated (Fig. 6). 
Fragment III was then ligated to the synthetic EcoRI-linker 
(5') dCAT(N) n GAATTC(N') n ATG, cleaved with EcoRI and dephospho- 
rylated with phosphatase. The resulting DNA was digested once more 
by Bglll and the resulting fragment IV was isolated. 

8d. Production of DNA encoding the amino terminal part of preprochymosin 
(Fig. 7) 

Plasmid pUR 1001 was cleaved with PstI and EcoRI. A fragment of 
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396 bp was isolated. This fragment was then treated with exonuclease 
III to procure single-stranded non-complementary DNA (A.J.H. Smith 
(1979), Nucleic Acids Res. 6, 831-841). 

This DNA was hybridized to preprochymosin mRNA under conditions 
described by G. Akusjarvi and U. Petterson (1978), Proc. Natl. Acad. 
Sci. U.S.A. , 75, 5822-5826. The cDNA synthesis was performed as 
described under 2. Following heat denaturation, dsDNA was made using 
DNA polymerase, large fragment, with (5 1 ) dAGGTGTCTCG 0H (3 l ) acting 
as a primer. The dsDNA was treated with nuclease SI and ligated to 
the synthetic EcoRI-linker dCAT(N) n GAATTC(N l ) n ATG. After EcoRI 
cleavage and dephosphorylation with (calf intestinal) phosphatase, 
the DNA was split once more with Bglll. The resulting circa 230 bp 
long fragment (V) was isolated. 

9. Construction of plasmids comprising a constitutive or inducible 
regulon, with or without a thermosensitive replication mutation 

9a. Construction of a plasmid pUR 201 (Fig. 8) 

A fragment containing 285 base pairs comprising double lac regulon 
(lac UV5) was obtained by restriction endonuclease EcoRI cleavage 
of pKB 268, described by K. Backman & M. Ptashne, Cell 13, 65-71 
(1978). This fragment was ligated in the EcoRI -site of pBR 322 DNA. 
Plasmid DNA with the lac regulon in the right orientation, pUR 200, 
(Fig. 8) was partly cleaved by EcoRI in the presence of /-:. t*oH 
RNA polymerase. The EcoRI cleavage site most distant from the 
restriction endonuclease Hindlll -cleavage site was preferentially 
attacked. The linearized DNA was treated with SI nuclease, purified 
by agarose gel electrophoresis, circularized by ligation with T4 
DNA-ligase and subsequently transformed into coVl* From the 
tetracycline-resistant transformants pUR 201 with the correct 
structure (Fig. 8) was obtained. 

9b. Construction of plasmid pUR 301 (Fig. 9) 

A DNA fragment of about 510 base pairs containing the trp regulon 
was obtained by restriction endonuclease Hinfl cleavage of ptrp ED5, 
as described by R.A. Hallewell & S. Emtage, Gene 9, 27-47 (1980). 
This fragment was cleaved with restriction endonuclease TaqI in the 
presence of E, coli RNA polymerase. The Taql-site in the trp regulon 
(described by K. Bertrand et ai. 9 Science 189, 22-26 (1975) and F. Lee 
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et al., J. MoT. Biol. 121, 193-217 (1978)) was selectively protected, 
thus yielding a fragment containing 234 base pairs comprising the 
trp regulon (Fig. 9). This fragment was then treated with SI nuclease, 
blunt-end ligated with the EcoRI-linker (5*) dGGAATTCC QH (3'), cut 
with EcoRI and subsequently cloned in the EcoRI-site of pBR 322. 

Plasmid pUR 300 with the trp regulon in the correct orientation 
(Fig. 9) was isolated. The EcoRI-cleavage site most distant from 
the Hind Ill-site was removed by partial cleavage of pUR 300 DNA by 
EcoRI in the presence of ethidium bromide and SI nuclease treatment. 
Linear DNA molecules were recircularized by T4 DNA ligase. From the 
tetracycline-resistant transformants pUR 301 with the structure as 
outlined in Fig. 9 was obtained. 

9c. Construction of plasmid pUR 401 (Fig. 10) 

A 269 base pairs fragment comprising the gene VHI-promotor was 
obtained by digestion of RF M13 DNA (DNA sequence 1128-1379 see 
P.M.G.F. van Wezenbeek et al. , Gene 1_1, 129-148 (1980), with the 
restriction nucleases TaqI and Haelll; the Taql-site was made blunt- 
ended by a repair reaction with E. ooU DNA polymerase; the fragment 
was subsequently partly digested with restriction enzyme Mnll. The 
partial products were treated with T4 DNA polymerase and SI nuclease 
and subsequently blunt-end ligated with the EcoRI-1 inker 
(5') dGGAATTCC QH (3 1 ), then treated with EcoRI and ligated in the 
EcoRI-site of the pBR 322. By restriction enzyme analysis and DNA 
sequencing, a plasmid was isolated in which the EcoRI-cleavage site 
was located just beyond the ribosome-binding site of the M13 gene 
VIII DNA sequence. Applicants have found that the plasmids having 
the M13 regulon from nucleotide 1128 to nucleotides 1291 to 1297 
were appropriate regulons for expression. The EcoRI-cleavage site 
most distant from the Hindlll-site was removed essentially as 
described for pUR 301. The complete construction of pUR 401 is 
outlined in Fig. 10. 

9d. Construction of plasmid pUR 303 (Fig. 11) 

Plasmid pUR 300 (9b, Fig. 9) was digested with EcoRI and the 234 bp 
fragment comprising the trp regulon was isolated. This fragment was 
ligated to pUR 301 DNA, which previously had been cleaved with 
EcoRI and dephosphorylated with phosphatase. The ligation mixture 
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was used to transform competent h:, .:ofi cells and from the anipici 11 in- 
resistant transformants pUR 302 v/as obtained. This plasmid comprises 
two trp regulons with identical transcription polarity (Fiy. II). 
pUR 302 was partially cleaved with EcoRI, in the presence of ethidium 
bromide, treated with nuclease SI to generate blunt-ends and the 
cleaved plasmid DNA's were religated. The ligation mix was used to 
transform competent E. coll cells and from the ampicill in-resistant 
transformants pUR 303 was isolated, wherein the EcoRI-site in between 
the two trp regulons in pUR 302 had been removed. 

9e. Construction of pUR 10 (Fig. 12) 

Plasmid pBR 322 was cleaved with PstI and PvuII, and subsequently 
dephosphoryl ated with phosphatase. The 2817 bp long fragment (D, 
Fig. 12) was isolated. 

Plasmid pBR 322 was also cleaved with MboII, treated with nuclease 
SI and phosphatase and then cleaved once more with PstI. The 400 bp 
long fragment (E, Fig. 12) extending from position 3201-3608 
(J.G. Sutcliffe, Cold Spring Harbor Symposia on Quantitative Biology, 
43, 77-90 (1978) was isolated. Plasmid pVU 208 (A.R. Stuitje, thesis, 
V.U. Amsterdam (1981)) was cleaved with BamHI and treated with nu- 
clease SI. The 760 bp fragment (F, Fig. 12) containing the replication 
origin of clo OF 13 with the cop ts mutation, was isolated. 

To construct pUR 10, fragments D and E v/ere li gated first, followed 
by ligation with fragment F. The ligation-mix was used to transform 
competent ff. ooVi cells. From ampicill in- and tetracycline-resistant 
transformants pUR 10-containing cells were isolated. The replication- 
origin-containing fragment is oriented such that the unidirectional 
replication is in a counter-clockwise direction. 

9f. Construction of pUR 210, pUR 310, pUR 311, pUR 410 (Fig. 13) 

These plasmids are derived from pUR 201, pUR 301, pUR 303 and pUR 401, 
respectively, and contain the cop ts replication origin of pUR 10. 
pUR 10 was digested with PstI and BamHI and the 2841 bp long fragment 
(G, Fig. 13) was isolated by agarose gel electrophoresis and electro- 
elution. 

Each of the plasmids pUR 201, pUR 301, pUR 303 and pUR 401 was 
digested with PstI and BamHI and the "regulon^containing fragments 
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(collectively called H, Fig. 13) were isolated. Fragment G and each 
of the fragments H in turn were li gated using T4 DNA ligase and the 
ligation mixes were used to transform competent ti. aoli cells. From 
5 ampicillin- and tetracycl ine-resistant colonies pUR210-, pUR310-, 

pU R 311- and pU R 410-containing cells were isolated. 

10. Construction of plasmids comprising a constitutive or inducible 
regulon and the ligated preprochymosin gene or its various allelic 
10 and maturation forms, the latter being under transcriptional 

control of said regulons, and transformation of said plasmids into 

ff. coVL 

10a. Construction of expression plasmids giving rise to the synthesis 

15 of bovine pseudochymosin (Fig. 14) 

Plasmids pUR 1520, 1530, 1540, 1730, 1820, 1830, 1840 and 1930 
were cleaved with EcoRI and dephosphorylated. Each preparation in 
turn was ligated with fragment I (8a, Fig. 4). This ligation mix 
was used to transform competent E.aoli RRI and from the ampicillin 

20 resistant transformants, cells containing pUR 1521, 1531, 1541, 1731, 

1821, 1831, 1841 and 1931 were selected which contained fragment I 
inserted such that the genetic information coding for pseudochymosin 
was present as a continuous uninterrupted entity. 

25 10b . Construction of expression plasmids giving rise to the synthesis of 
chymosin (Fig. 15) 

Plasmid pUR 1521 was cleaved with Hindi II, dephosphorylated and then 
cleaved once more with Bglll. The resulting ^ 1300 bp long fragment 
VI (Fig. 15) was purified. The vector fragments C (8a, Fig. 3), in 
turn, were ligated with fragments II (8b, Fig. 5) and fragment VI. 
The ligation mix was used to transform competent E. aolt cells and 
from ampicill in-resistant transformants cells containing pUR 1522, 
1532, 1542, 1732, 1822, 1832, 1842 and 1932 were selected. 

35 10c . Construction of expression plasmids giving rise to the synthesis 
of prochymosin (Fig. 16) 

Vector fragments C (8a, Fig. 3), fragment IV (8c, Fig. 6) and 
fragment VI (9b, Fig. lb) were lighted and the resulting liydliun 
mix was used to transform competent /■;. -joli cells. 
•'^ From the ampici 11 in-resistant transfonnant cells containing pUR 
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1523, 1533, 1543, 1733, 1823, 1833, 1843 and 1933 were selected. 



lOd . Construction of expression plasmios giving rise to Lhe expression 
5 of preprochymosin (Fig. 17) 

Vector fragments C (8a, Fig. 3), fragment V (8d, Fig. 7) and frag- 
ment VI (9b, Fig. 15) were ligated and the resulting ligation mix 
was used to transform competent ii. aoH cells. 
From the ampicill in-resistant transformants cells containing pUR 
10 1524, 1534, 1544, 1734, 1824, 1834, 1844 and 1934 were selected 

(Fig. 17). 

IQe. Example demonstrating the use of site-directed mutagenesis to 
create allelic forms of the bovine preprochymosin or its 

15 maturation forms, departing from the chemical structure given 

in Fig. 1 and thereby converting residues 202 and 286 into 
aspartic acid residues, such in turn or in combination (Fig. 18-22). 
Plasmid pUR 1001 was cleaved with PstI and the resulting DNA frag- 
ments were ligated to the synthetic pentanucleotide 

20 (5 1 ) Ho dCTGCA OH Following ligation, the mixture was incubated 

with E. ooli DNA polymerase, large fragment in the presence of 
dGTP in order to make blunt-ends and phosphorylated with T4 kinase 
and ATP. The DNA was subsequently supplied with EcoRI-1 inker 
(5 1 ) dCATGAATTCATG (3 1 ) and then treated with EcoRI. A circa 880 bp 

25 long fragment extending from the EcoRI-site at position 549 to the 

carboxy terminal end of the chymosin encoding DNA was isolated. 
This fragment was subsequently cloned in the EcoRI-site of RFM13 
mp 2. Two clones were isolated, M13 1020 and M13 1021, which were 
different in the orientation of the EcoRI-insert with respect to 

30 each other (cf. Fig. 18). M13 1020 contained the coding strand 

(plus strand); M12 1021 contained the non-coding strand (minus 
strand), ss. Phage DNA of M13 1020 was converted into double- 
stranded DNA using E. coli CNA polymerase large fragment, dNTP's 
and (5 1 ) dTGGCCATCCCTGTCC (3 1 ) i 

35 

(675) 

or (5 l ) dAAACTCATCGTACTG (3*) ii 
(928) 

in turn as primers, using procedures described by S. Gil lam et at. 
(1979); Nucleic Acids Res. 5 6, 2973-2985. 



BA D ORIGINAL U 



0077109 

* 17 " Q 599 (R) 

The underlined bases represent mismatches in the primer/template 
hybrid. Following transformation of competent a. aoli JM101.7118 
cells (B. Gronenborn & J. Messing, Nature, 272, 375-377 (1978)), 
phages were screened for the required conversion into the 
chymosin encoding sequence by plaque hybridization with the 32 p 
labelled pentadecanucleotides i, ii, as probes and DNA-sequence 
analysis. 

Two phage isolates, M13.1022 and 1023 contained the DNA-sequences: 

asp arg asp gly 
(5') G GAC AGG GAT GGC CA (3') M13.1022 

675 

glN tyr asp glu phe 
(5') CAG TAG GAT GAG TTT (3') M13.1023 

928 

RF M13.1022 and RF M13.1023 were cleaved with EcoRI, dephospho- 
rylated and then cleaved with Pstl. From each preparation an 888 bp 
long fragment was isolated by agarose gel electrophoresis and 
electroelution. Apart from the required mutations, this fragment 
corresponds with fragment B (>La, Fig. 3). 

Using procedures identical with those described in 8a-8c, expression 
plasrnids were constructed which gave rise to the synthesi s of speci- 
fically altered pseudochymosin, chymosin, prochymosin and prepro- 
chymosin, respectively. 

• Culturing of A'. aoU cells containin g r ecombinant plasrnids de- 
scribed under 10 and detection and isolation of preprochymosin , 
prochymosin, pseudochymosin or chymosin 
E. aoli cells containing one of- the plasrnids 

PJR1521 (ATCC 39120), 1531, 1541, 1731, 1821, 1831, 1841, 1932 
PUR 1522, 1532, 1542, 1732, 1822, 1832 (ATCC 39197), 1842, 1932 
p(JR 1523, 1533 (ATCC 39121), 1543, 1733, 1823, 1833, 1843, 1933 
PUR1524, 1534, 1544, 1734 (ATCC 39198), 1824, 1834, 1844, 1944 
with or without the AATT-sequence in the linker oetween the reguion 
and the preprochymosin gen^: or its maturation forms were cultured. 
under optimal conditions for their growth. These culturing condi- 
tions vary with the type of plasmid present in the cells, oui a 
suitable antibiotic (ampicillinj was always present to maintain 
selection pressure. * BAD ORiGSftA* 
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Under these conditions the cells containing either plasmids 
pJR1521, 1531, 1541, 1731, 1821, 1831, 1841, 1931 or 
pJR 1522, 1532, 1542, 1732, 1822, 1832, 1842, 1932 or 
pJR1523, 1533, 1543, 1733, 1823, 1833, 1843, 1933 or 
pJR 1524, 1534, 1544, 1734, 1824, 1834, 1844, 1944 
produced considerable amounts of pseudochymosin , chymosin, 
prochymosin or preprochymosin, respectively. These amounts varied 
from 10 3 -10' molecules/cell, 

E. coli cells which contained preprochymosin or modified prepro- 
chymosin encoding plasmids contained (modified) preprochymosin 
in the cytoplasm and prochymosin in their periplasmic space. 

The bacterially produced preprochymosin, prochymosin and pseudo- 
chymosin could be converted into chymosin using the procedures 
described by \/. Barkholt Pedersen et at. (Eur. J. Biochem., 94, 
573-580 (1979)). The chymosins which were thus obtained and bac- 
terial ly-produced chymosin were shown to be fully biologically 
active in proteolysis. 

The presence of the proteins was further demonstrated by SDS- 
polyacryl amide gel electrophoresis with or without immuno- 
precipitation, and by immunological ELISA and RIA tests. The 
antisera for this test were generated by injecting bovine calf 
chymosin supplemented with Freund adjuvant into sheep as well 
as rabbits. 

The above description was focussed on the synthesis of chymosin in 
E. ooli cells; it is of course possible and desirable to use for 
that purpose non- toxic, edible micro-organisms, such as strepto- 
cocci or micro-organisms of Bacillus or yeast origin. 
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Abbreviations 

DNA deoxyribonucleic acid 

cDNA complementary DNA 

dsDNA double-stranded DNA 

RNA ribonucleic acid 

mRNA messenger RNA 

A adenine 

G guanine 

C cytosine 

T thymine 

RBS ribosome-binding site 

N any nucleotide 

bp pase pair 

M13 bacteriophage M13 

RF replicative form 

p plasmid 

32 P phosphorus 32 

n number 

p/o promotor/operator 

trp tryptophan 

lac lactose 

EcoRI restriction endonuclease derived from 

Hindi I I restriction endonuclease derived from 

Hinfl restriction endonuclease derived from 

Haelll restriction endonuclease derived from 

Bglll restriction endonuclease derived from 

HphI restriction endonuclease derived from 

PstI restriction endonuclease derived from 

Mnll restriction endonuclease derived from 

TaqI restriction endonuclease derived from 

BamHI restriction endonuclease derived from 

E. coli Escherichia coli 

ts thermosensitive 

Ddel restriction endonuclease derived from 

Hpall restriction endonuclease derived from 

ELISA enzyme linked immuno sorbent assay 

RIA radioimmune assay 

SDS sodium dodecyl sulphate 
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Escherichia coli RYI 
Haemophilus Influenzae Rd III 
Haemophilus Influenzae 
Haemophilus aegypticus 
Bacillus globii 
Haemophilus parahaemolyticus 
Providencia stuartii 
Moraxe I la nonliquef aciens 
Thermophilus aquaticus 
Bacillus amy loliquef aciens H 



Desulfovibrio desulfuricans 
Haemophilus parainfluenzae 
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Abbreviations (continued) 





met 


rne union i ne 




1 eu 


1 oi ir* i np 


b 


i le 


iso i euc i ne 




_ T _ 

ala 


al anine 




asp 


sspartic acid 




asN 


asparagi ne 




glu 


g I uuarm c aci u 




y i ti 


y I u Lain i lie 




Va 1 


Va 1 I He 




unr 


Uf 1 i Cull l I tc 






nhpnvl al ani np 

piiciijr io t ail I i * c 




tyr 


tyros i ne 




cys 


cys xe i ne 




arg 


a vni m no 

dry i n i ne 




C Q v* 

ser 


ben ne 




nib 


II 1 b L 1 U l I IC 




pro 


pro! ine 


20 


giy 


glycine 




lys 


lysine 




Ap 


ampicillin resistance 




Tc 


tetracycline resistance 
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Legends to the Figures : 

Fig. 1. Amino acid sequence of one of the allelic forms (B) of bovine 
preprochymosin and DNA sequence corresponding to the bovine 
preprochymosin B mRNA. 

The S numbering denotes numbering of the signal sequence amino 
acid residues. 

Fig. 2. Schematic representation of several examples of allelic forms 
of bovine prenrochymosin. The upper drawing corresponds with 
Fig. 1. The numbering in parentheses denotes the amino acid 
residues in the preprochymosin molecule. Other numbers refer 
to the nucleotide sequence of the mRNA. 

Fig. 3. Construction route of pUR 1520, 1530, 1540, 1730, 1820, 1840 
and 1930, described in 8a. 

Fig. 4. Construction route of dsDNA encoding amino terminal end of 
pseudochymosin plus a transcriptional initiation triplet, 
described in 8a. 

Fig. 5. Construction route of dsDNA encoding the amino terminal end 
of chymosin plus a transcriptional initiation triplet, 
described in 8b. 

Fig. 6. Construction route of dsDNA encoding the amino terminal end 
of prochymosin plus a transcriptional initiation triplet, 
described in 8c. 

Fig. 7. Construction route of dsDNA encoding the amino terminal end 

of preprochymosin, described in 8d. 
Fig. 8. Construction route of pUR 201, described in 9a. 
Fig. 9. Construction route of pUR 301, described in 9b. 
Fig. 10. Construction route of pUR 401, described in 9c. 
Fig. 11, Construction route of pUR 303, described in 9d. 
Fig. 12. Construction route of pUR 10, described in 9e. 
Fig. 13. Construction route of pUR 210, 310, 311 and 410, described 

in 9f. 

Fig. 14. Construction route of pUR 1521, 1531, 1541, 1731, 1821, 1831, 

1841 and 1931, described in 10a. 

Fig. 15 Construction route of pUR 1522, 1532, 1542, 1732, 1822, 1832, 

1842 and 1932, described in 10b. 

Fig. 16. Schematic representation of pUR 1523, 1533, 1543, 1733, 1823, 

1833, 1843 and 1933, described in 10c. 

Fig. 17. Schematic representation of pUR 1524, 1534, 1544, 1734, 1824, 

1834, 1844 and 1934, described in lOd. 
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Fig. 18. Construction route of M13.1020, M13.1021 and part of dsDNA 
encoding an allelic variation of bovine preprochymosin, 
described in lOe. 

5 Fig. 19. General representation of the plus strand DNA sequence corre- 
sponding to the structural gene encoding preprochymosin 

wherein : 

A is deoxyadenyl , 

G is deoxyguanyl , 

10 C is deoxycytosyl , 

T is thymidyl , 

J is A or G; 

K is T or C; 

L is A, T, C, or G; 

15 M is A, C or T; 

X is T or C if Y is A or G, and C if Y is C or T; 

Y is A, G, C or T if X is C, and A or G if X is T; 
W is C or A if Z is G or A, and C if Z is C or T; 
Z is A, G, C or T if W is C, and A or G if W is A; 

20 QR is TC if S is A, G, C or T, and AG if S is T or C; and 

S is A, G, C or T if QR is TC, and T or C if QR is AG. 
Fig. 20. General representation of the plus strand DNA sequence corre- 
sponding to the structural gene encoding prochymosin plus a 
transcriptional initiation ATG-triplet wherein : 

25 A is deoxyadenyl, 

G is deoxyguanyl , 
C is deoxycytosyl 9 
T is thymidyl, 
J is A or G; 

30 K is T or C; 

L is A, T, C, or G; 
M is A, C or T; 

X is T or C if Y is A or G, and C if Y is C or T; 

Y is A, G, C or T if X is C, and A or G if X is T; 
35 W is C or A if Z is G or A, and C if Z is C or T; 

Z is A, G, C or T if W is C, and A or G if W is A; 
QR is TC if S is A, G, C or T, and AG if S is T or C; and 
S is A, G, C or T if OR is TC, and T or C if QR is AG. 
Fig. 21. General representation of the plus strand DNA sequence corre- 
40 spondinq to the structural gene encoding pseudochymosin plus 
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a transcriptional initiation ATG-triplet wherein : 
A is deoxyadenyl , 
G is deoxyguanyl , 
C is deoxycytosyl , 
T is thymidyl , 
J is A or G; 
K is T or C; 
L is A, T, C, or G; 
M is A, C or T; 

X is T or C if Y is A or G, and C if Y is C or T; 

Y is A, G, C or T if X is C, and A or G if X is T; 
W is C or A if Z is G or A, and C if Z is C or T; 
Z is A, G, C or T if W is C, and A or G if W is A; 
QR is TC if S is A, G, C or T, and AG if S is T or t; and 
S is A, G, C or T if QR is TC, and T or C if OR is AG. 

Fig. 22. General representation of the plus strand DNA sequence corre- 
sponding to the structural gene encoding chymosin plus a 
transcriptional initiation ATG-triplet wherein : 
A is deoxyadenyl , 
G is deoxyguanyl , 
C is deoxycytosyl » 
T is thymidyl , 
J is A or G; 
K is T or C; 
L is A, T, C, or G; 
M is A, C or T; 

X is T or C if Y is A or G, and C if Y is C or T; 

Y is A, G, C or T if X is C, and A or G if X is T; 
W is C or A if Z is G or A, and C if Z is C or T; 
Z is A, G, C or T if W is C, and A or G if W is A; 
OR is TC if S is A, G, C or T, and AG if S is T oi C; and 
S is A, G, C or T if QR is TC, and T or C if OR is AG. 

General remarks on the figures 

Generally plasmid DNA is drawn as a single-lined circle, still this 
represents double-stranded DNA (bacteriophage M13. DNA is single-stranded; 
the replicative form RF, however, is double-stranded). 5 f -ends of cleaved 
DNA at restriction enzyme cleavage site are phosphorylated unless indi- 
cated otherwise; 3 [ -ends are always dephosphorylated. 
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'the numbers given in italics refer to the bovine preprochymosin DNA 
sequence given in Fig. 1; otherwise they refer to plasmid DNA sequences. 



5 SB dA/uT-stretch 
<\fl/W dG/dC~stretch 

vsssXsx regulon, the arrow indicates transcription direction 
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CLAIMS 

!. DNA sequences encoding various allelic and maturation forms of 
mammalian preprochymosin according to Fig. 1 and 2. 

2. DNA sequences encoding various allelic and maturation forms of 
mammalian preprochymosin according to Fig. 19, 20, 21 and 21. 

3. A recombinant plasmid comprising 

(i) a structural gene coding for the various allelic and 
maturation forms of mammalian preprochymosin according to Fig, 1 and 2 
or 19-22. 

(ii) specific DNA sequences regulating the expression of said 
gene in a microbial host, consisting of an inducible or constitutive 
regulon. 

4. A recombinant plasmid according to Claim 2, wherein the regulon 
consists of a double lac UV5 system according to Fig, 8. 

5. A recombinant plasmid according to Claim 3, comprising a regulon 
consisting of at least one modified tryptophan system according to 
Fig. 9, wherein the information coding for the trp attenuator protein 
is eliminated while its ribosome-binding site is maintained. 

6. A recombinant plasmid according to Claim 5, comprising two trp 
regulons linked in a head to tail fashion according to Fig. 11. 

7. A recombinant plasmid according to Claim 3, comprising a regulon 
consisting of at least one modified promotor/ribosome-binding site of 
gene VIII of bacteriophage M13, according to Fig. 10. 

8. A recombinant plasmid according to Claim 2, comprising a 
thermosensitive replication mutant derived from the clcacir, DT 13 
plasmid pVU 208 according to Fig. 12 and 13. 

9. A recombinant plasmid according to Claim 1, selected from 
pUR 1521 (AK.C 39120), 1531, 1541, 1731, 1821, 1831, 1841, 1931 

P lJR 1522 > 1532, 1542, 1732, 1822, 1832 ^ATCC 39197), 1842, 1932 

pUR 1523, 1533 (AfCC 39121), 1543, 1733, 1823, 1833, 1843, 1933 

pl/R 1524, 1534, 1534, 1544, 1734 (ATCC 39198), 1824, 1834, 1844, 1944 
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10. A bacterial culture comprising E-ccli cells comprising a placed 
according to Claim 9. 

11. Micro-organisms, particularly non-toxic 9 edible micro-organisms 
such as streptococci or lactobacilli, or micro-organisms of bacillus or 
yeast origin, in which are incorporated a DNA sequence according to Claim 1 
or Claim 2, and an appropriate regulon, suitable for the production of 
bovine calf preprochymosin or any one of the various allelic and maturation 
forms thereof. 
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3. A process for producing DMA sequences encoding various allelic 
and maturation forms of preprochymosin according to Fig. 1 and 2 or 
Fig. 19-22, characterized in that : 

(a) the message RNA (rnRNA) of preprochymosin is isolated and 
purified; 

(b) rnRNA is converted into double stranded DNA (dsDNA); 

(c) said dsDNA is incorporated in an appropriate plasmid; 

(d) said plasmid is transferred into bacterial cells and hybrid 
plasmid DNA molecules are selected to yield a plasmid containing a 
substantially complete copy of the preprochymosin rnRNA; 

(e) this DNA is further genetically engineered so as to code 
exactly for preprochymosin or its maturation forms and brought under 
transcriptional control of appropriate regulons. 

2. A process for producing a recombinant plasmid, characterized by 
combining 

(i) a structural gene coding for the various allelic and 
maturation forms of mammalian preprochymosin according to Fig. 1 and 2 
or 19-22; with 

(ii) specific DNA sequences regulating the expression of said gene 
in a microbial host, consisting of an inducible or constitutive regulon. 

3. A process according to Claim 2, characterized in that the regulon 
consists of a double lac UV5 system according to Fig. 8. 

4. A process according to Claim 2, characterized in that a regulon 
is used which consists of at least one modified tryptophan system 
according to Fig. 9, wherein the information coding for the trp 
attenuator protein is eliminated while its ribosome-binding site is 
maintained. 

5. A process according to Claim 4, characterized in that 2 trp 
regi Ions are used which are linked in a head to tail fashion according 
to Fig. 11. 
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6. A process according to Claim 2, characterized in zhot a iegulon 
is used which consists of at lna*t. one modifier? prwni-or/Hhnso«!9- 
binding site of oene VIII of bacteriophage Mi 3, according co Fig. 10, 

7. A process according to Claim 2, characterized in that a therrao- 
sensitive replication mutant is used v/hich is derived from the cloacin 
DM3 plasmid pVU HQS according to Fig. 12 and 13. 

8. A process according to Claim 2, characterized in that a plasmid 
is used which is selected from : 

pUR 1521 (ATCC 39120), 1531, 1541, 1731, 1821, 1831, 1841, 1931 

PUR 1522, 1532, 1542, 1732, 1822, 1832 (ATCC 39197), 1842, 1932 

pUR 1523, 1533 (ATCC 39121), 1543, 1733, 1823, 1833, 1843, 1933 

pUR 1524, 1534, 1534, 1544, 1734 (ATCC 39198), 1824, 1834, 1844, 1944 

9. A process for producing mammalian chymosin, characterized in that 

(a) a plasmid according to Claim 2 is incorporated in a micro- 
organism; 

(b) the transformed micro-organism is cultivated and 

(c) chymosin produced by said micro-organism is isolated. 

10- A process according to Claim 9, characterized in that the micro- 
organisms consist of E-coli cells. 

11. A process according to Claim 9, characterized in that non-toxic 
edible micro-organisms such as streptococci : or lactobacil li , or micro- 
organisms of bacillus or yeast origin are used. 
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* 1 % 1 0 

met cirg cyi leu vai val leu lc»u ala val phe au leu 

CAOCO ©CT ©OA CCC ACA TCC AAG ATG AGC TGT CTC GIG GTG C7A CTT GCT GTC TTC OCT CTr 
3 

•13 1 1 O 

• •r glN gly ALA OLU ILL THR ARG ILE PRO I £ U TVH LYSJ CI Y LYS Sf P LEU ARC LYS ALA 

TCC CAA GGC GCT GAG ATC ACC AGG ATC CCT CI G TAC AAA GGC AAG TCT CTG AGG AAG GCG 
63 

20 » 30 

LEU LYS OLU HIS CLY LEU LEU CLU ASP PHH LEU GLN LYS GLN GLN TYR GL v ILE SER SER 

CTO AAG CAG CAT GGG CTT CTG GAG OAC TTC CTG CAG AAA CAG C AG TAT GGC ATC AGC AGC 
123 

40 50 

LYS TYR SER OLY PHE OLY OLU VAL ALA SER VAL PRO LEU THR ASN TYR t EU ASP SER GLN 

AAO TAC TCC OOC TTC OCO GAG CTO ©CC AGC GTG CCC CTG ACC AAC TAC CTO GAT AGT CAG 
183 

AO 70 

TYR PHE OLY LYS ILE TYR LEU CLY THR PRO PRO CLN CLU PHE THR VAL LEU PHE ASP THR 

TAC TTT 000 AAO ATC TAC CTC 000 ACC CCC CCC CAG GAG TTC ACC OTC CTO TTT CAC ACT 
243 

SO 90 

OLY SCR SER ASP PHE TRP VAL PRO SER ILE TYR CYS LYS SER ASN ALA CYS LYS ASN HIS 

OOC TCC TCT OAC TTC TGG OTA CCC TCT ATC 1AC TOC AAG AGC AAT GCC TGC AAA AAC CAC 
303 

too no 

OLN AAO PHE ASP PRO ARO LYS SER SER THR PHE CLN ASN LEU CLY LYS PRO LEU SER ILE 

CAO COC TTC CAC CCG ACA AAG TCG TCC ACC TTC CAG AAC CTC CGC AAG CCC CTG TCT ATC 
363 

120 l30 

HIS TYR OLY THR GL Y SER MET CLN CLY ILE LEU CLY TYR ASP THR VAL THR VAL SER ASN 

423 CAC CG ° ACC CTC ACT GTC TCC AAC 

140 ISO 

ILE VAL ASP ILE GLN CLN THR VAL GL Y LEU SER THR CLN OLU PRO CLY ASP VAL PHE THR 

ATT OTO CAC ATC CAG CAG ACA GTA GGC CTG AGC ACC CAG GAG CCC GGG G*C GTC TTC ACC 
483 

160 170 

TYR ALA CLU PHE ASP GLY ILE LEU CLY ME^ ALA TYR PRO SER LEU ALA iXR GL U TYR SLR 

TAT CCC OAA TTC GAC CCG ATC CTG CCG ATG GCC TAC CCC TCG CTC CCC TCA CAG TAC TCO 
543 

ISO | 90 

ILE PRO VAL PHE ASP ASN MET MET ASN ARG HIS l M« VAL ALA GLN AO ICU PK Sf R VAL 

ATA CCC GTG TTT GAC AAC ATG ATG AAC AGC CAC TG GTG GCC CAA GAC CTO 7 TC TIG GTT 
603 

200 ?l0 

TYR MET ASP ARG ASN GLY CLN GLU SLR MET I EU THR LEU CLY ALA HE ASP PRO 'oLR TYR 

TAC ATC CAC AGG AAT GGC CAG GAG ACC A"! G CTC ACG C!G GG'J GCC AIC CAC CCG TCC TAC 
663 



Fig. I 



V- 



Q 599 

007710! 



220 ?30 

TVR THR OLY 6ER LEU HIS TRP VAL PRO VAL THR VAL GLN GLN TYR TRP Ci N PHET THR VAL 

TAC AC A GOG TCC CTO CAT TOO CTO CCC GTC AC A GTG CAG CAC 1AC TOG C AG T7C ACT GTC 
723 

240 250 

ASP SER VAL THR ILE SER GLY VAL VAL VAL ALA CYS GLU. GLY GLY CYS GL.N ALA ILE LEU 

OAC AGT OTC ACC ATC AGC GOT GTC CTT CTG CCC TGT GAG CG T GGC TGT CAG CCC ATC CTC 
783 

260 270 

ASP THR OLY THR SER LYS LEU VAL GLY PRO S£ R SER ASP ILE LEU ASN ILE GLN GLN ALA 

OAC ACO GOC ACC TCC AAG CTG OTC GGG CCC AGC AGO GAC ATC CTC AAC ATC CAG CAC CCC 
B43 

280 290 

ILE CLY ALA THR OLN ASN CLN TYR OLY OLU PHE ASP ILE ASP CYS ASP ASN LEU SER TYR 

ATT OOA CCC ACA CAO AAC CAO TAC CGT CAC TTT CAC ATC GAC TCC CAC AAC CTG ACC TAC 
903 

300 310 

MET PRO THR VAL VAL PHE GLU ILE ASN GLY LYS MET TYR PRO LEU THR PRO SER ALA TYR 

ATO CCC ACT CTO OTC TTT CAG ATC AAT GGC AAA ATG TAC CCA CTC ACC CCC TCC GCC TAT 

320 330 

THR SER OLN ASP OLN CLY PHE CYS THR SER GLY PHE GLN SER GLU ASN HIS SER GLN LYS 

ACC ACC CAO CAC CAO CCC TTC TOT ACC AGT CCC TTC CAC AGT GAA AAT CAT TCC CAC AAA 
1023 

340 350 

TRP ILE LEU CLY ASP VAL PHE ILE ARC CLU TYR TYR SER VAL PHE ASP ARG ALA ASN ASN 

TGC ATC CTO CCC CAT CTT TTC ATC CGA GAC TAT TAC AGC GTC TTT GAC ACQ GCC AAC AAC 
1083 

360 365 

LEU VAL CLY LEU ALA LYS ALA ILE •** 

CTC CTO 000 CTC OCC AAA GCC ATC TCA TCA CAT CGC TCA CCA AGA ACC TCA CTC TCC CCA 
1143 

CAC ACC TGC ACA CAC ACA TGC ACA CAT GTA CAT CAC CAC ATG TGC ACA CAC ACA GAT CAG 
1203 

GTT TCC ACA CAC ATC ATT CTC AAT AAA CGT TGT CTT TCT GCA AAA AAA A 

12*3 1303 
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ATCWGZTGKXTYGTLGTLX TYX TYCCLGTLTTKGCLX TYQRSCAJCGLGCLCAJATMACL 
WGZATMCCLXTYTAKAAJGGLAAJQRSXTYWGZ AAJCCLX TYAAJGA JCAK.GGT.X TYX T Y 
GAJGAKTTKXTYCAJAAJCAJCAJTAKGGLATMQRSQRSAAJTAKQRSGGLTTKGGLGAJ 
UJ-LG'CLyKSMiiLcCLX i i A CLaAKT A i\ a I Y G A K QR3CAJ7A1C TTK G G L A A J A TUT A K X T Y 
GGLACLCCLCCLCAJGA JTTKACLGTLX T YT TKGAKA CLGG LQRSQRSGAKTTKTGGGTL 
CCLQRSATMTAKTGKAAJqRSAAKGCLTGKAAJ AAKCAKCAJWCZTTKGAKCCLWGKAAJ 
QRSQRSACLTTKCAJAAKX TYCGLAAJ CCLX TYQRSATMCAKTAKGG1 A CLCGLQRS ATG 
CAJGGLATMX TYGGLTAKGAKA CLGTLACLGTLQRSAAKATMG TLGAKATMCAJCA JACL 
GTLGGLX TYQRSACLCA JGA J C CLGG LGAKGTLTTKA CL TAKGCLG A J TTKGAKGGLATM 
X TYGGLATGGCLTAKC CLQRSX TYGCLQRSGAJ TAKQRS ATMCCLGTLTTKCAK AAKATG 
ATGAAKQRS CAKX TYGTLGCLCA J GAKX TYTTKQRS G TLTAKATGGAKWGZ AAK GG LC A J 
C AJQRSATGX TYACLX T YGGLG CLATMG AKCCLQRS TAKTAKA CLGGLQRSX T YCAKTGC 
G TLCCLGTLACLGTLCAJCAJTAKTGCCAJTTKA CLGTLGAKQRSG TLACLATMQ R SGCT, 
GTLCTLGTLGCLTGKGAJGGLGGLTGKCAJGCLATMXTYCAKACLGCLACLQRSAAJXTY 
g tlgglcclqrsqrsga'katmx TYAAKATMCAJ CAJGCLATMGGLGCLACLCAJA AKCA J 
TAKGGLGAJTTKGAKATMG AKTGKGAKAAKX TYQRS TAKATG CCLACLG TLGTLTTKGA J 
ATMAAKGGLAA JATG TAKC CLX T YA CLCCLQRSG CLTAKA CLQRS CAJ GAK CA J GG LTTK 
TGKACLQRSGGLTTKCAJQRSG A J AAKCAKQRS CAJ A AJ TGG ATMX T YGGLGAK G TLTTK. 
ATMWGZGAJ TAXI AKQRSG TLTTK GAK WGZGCLAAKAAKX TYG TLGGKX TYGCLAAJGCL 
ATMTGA 
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ATGGCLGAJATMACLWGZ ATMCCLX TYTAKAAJ CGLAA JQRSX T YV/GZ AA JGCLX TYAAJ 
G AJ CAKGGLX TYX T YGA J GAKTTKX T YCA J A A J C AJ C A J TAKCCLATMQKSQRS AA JT AK 
QRSGGLTTKGGLGAJGTLGCLQRSGTLCCLX TYA CLAAKTAKX TYGAKQRS CAJTAICTTK 
u^LaAJA TMT AKX TYCCLACLCCL C CL C * J C A J T T K A r L n T T v t Y T ty k A K A CT,G G LOR S 
QRSGAKTTKTGGGTLCCLQRS ATMTAKTGKAAJ QRS AAKGCLTG KAAJ AAKCAKCAJ WGZ 
T TKGAKCCLWGKAAJ QRSQRS A CLTTKCAJ AAKX TYGGLAAJ CCLX T YQRS ATM CAKTAK 
CGLACLGGLQRSATGCAJGCLATMXTYGGLTAKCAKACLGTLACLGTLQRSAAKATMGTL 
G AKATMCA JCAJACLGTLGGLX T YQRS ACL CAJGA JC CLGGLCAKCTLTTKACLTAK.CCL 
G AJ TTKGAKGGLATMX T YGGLATGGCLTAKC CLQRSX TYCCLQRSG AJTAKQRS ATMCCL 
GTLTTKGAKAAKATGATGAAKQRSCAKX TYGTLGCLCAJGAKXTYTTKQRSGTLTAKATG 
GAKWGZAAKGGLCAJGAJQRSATGX TYACLX T YGGLGCLATMGAKC CLORSTAKTAKACL 
G GLQRSX T YCAKT GGG TLC CLG TLA CLGTLCA J CA JT AKTGG C A J TTKA CLG TLGAKQRS 
G TLACLATMQRSGGLGTLGTLGTLGCLTGKGA J GG LG GLTGKCA J GCLATMX TYGAK A CL 
GGLACLQRSAAJX T YGTLGGLC CLQR SQ RSGAKATMX T YA AKATM CA.J CA JGCLATMGGL 
G CLACLCAJAAKCA JTAKGGLGAJ TTKG AKATMG AKTGKGAKA AKX T YQRSTAKATG CCL 
ACLGTLGTLTTKGA JATMAAKGGLAAJ ATGTAKC CLX TYA CLCCLQRSG CLTAKA CLQRS 
CAJ GAKCAJGGLTTKTGKACLQRSGGLTTKCAJQRSGA JAAKCAKQRS CAJAAJTGGATM 
X TYGGLGAKGTLTTKATMWGZG A JTAKTAKQRSG TLTTKCAKWCZG CLAAKA AKX TYGTL 
GGKX TYGCLAAJGCLATMTGA 
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ATGX TYCAJAAJCAJ CAJTAK r :GLAT'MQRSQRSAAJTARQRSGGLTTKGGLGAJGTLGGL 
QRSGTLCCLX TYACLAARTARX TYGAKQRS CA JTARTTRCGLAAJATMTAKXTYGGLAGL 
CCLCCLCAJGAJTTKACLGTLX TYTTRGARACLGGLQRSQRSGAKTTRTCGGTLGCLQRS 
ATMTAKTGKAAJ QRSAAKOCLTGK AAJA AKCAKCAJWGZTTKGAKCGLWGKAA.JQRSORS 
A CLTTRCAJA ARX TYGGLAA JCCLX T YQRS ATMCAKTAKCG LA CLGGLQRS ATGCAJ GGL 
ATMX TYGGLTAKGAKA CLGTLA CLGTLQRSAAKA TMG TLGAKA TM GAJ CA.J A CLGTLGGL 
X TYQRSACLCAJ GAJ CCLGGLGARGTLTTRA CLTAK GCLCA J T f KGARGGLATMX TYGGL 
ATCGCLTAKCCLORSX TYGCLQRSGAJTARQU.SATMCCLC'i LT TKGAKAAKATCATGA AK 
Q RS CAKX TYGTLGCLCAJ GAKX TYTTRQRSG TLTAKATGCAKWCZ AARGG LCAJ GA . OPS 
ATGX T YA CLX T YG G LG CL ATMG AK CCLQRS TARTAR A GLCGLQRSX TYCAKTGGG TLCCL 
GTLACLGTLCAJ CAJTARTGGCAJTTKACLGTLGAKQRSC TLACLATMORSGGLGTLGTL 
GTLGCLTGRGAJ GG LCGLTCRCAJ GCLA TMX TYGARA CLOG LA Ci.QRS AAJX TYGTLGGL 
CCLQRSQRSGAKATMX TYAAKATMCAJ CAJGC.LATMGG LGCLA CLGAJ AAK CAJ TARGG I, 

GAJTTKGAKATMGAKTGKCAKAARXTYQRSTAKATGCCLACLG ILGTLTTRGAJ ATMAAK 
GGLAAJATGTAKCCLX TYACLCCLQRSG CLTARA CLQRS CAJ GAKCAJGGLTTKTGRACL 
QRSGGLTTKCAJQRSGAJAARCARQRSCAJ AAJTGCATMX TYGGLGAKGTLTTK ATMWGZ 
G AJTAKTARQRSGTLTTKCARWGZG CLAARAAKX TYGTLCGKX T YGCLAA J GCLATMTGA 
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ATGGGLCAJGTLGCLQRSGTLCCLX TYACLAAKTAKX TYGAKQRS CAJTAKTTKGGLAAJ 
ATMTAKX T YGGLACLCCLCCL CAJCA JTTKA CLCT1<X T YTTKGAK ACLGGLQRSQRSG AK 
TTKTGGGTLCCLQRSATMTAKTGKAAJ ^KSAAKUCL ibKAAJ aaklaKCa.j wu/.i itCunis. 
CCLWGKAAJQP.SQRS ACLTTKCA.JAAKX TYGGLAAJC CLX TYQRS ATMCAKTAKGGLACL 
GGLQRSATGCAJGGLATMX T YGCLTAKGAKACLGTLACIX'TlA'RSAAKATMCTLGAKATM 
CAJ CAJACLGTLGGLX TYORSACLCAJGAJCCLGGLCAKCTLTTKACLTAKGCLCAJTTK 
G AKGCLATMX TYGGLAl'GGCLTAKCCLQRSX T YGCLQR SG A J T AK QRSATM CCLCTLTTK 
GAKAAKATGATGAAKQRSCAKX TYGTLGCLCA JGAKXTYTTK QRSGTL TAKATGGAKWGZ 
AAKGGLCAJGA JQRS ATGX T YACLX X YGG LG CLATMC AK C CLQRSTAKTAKA CLGC LQRS 
X TYCAKTGGGTLCCLGTLACLGTLGA J CA JTAKTGGC A J TTKA CLGTLGAKQRSG TLA CL 
ATMQRSGGLGTLGTLGTLGCLTGKCAJGGLGGLTGKCA JGC1.ATMX T YGAK A CLGG LA CL 
Q RSAA JX TYGTLCGLCCLQRSQRSGAKATMX TYAAKATM CAJ CAJ C. CLATMC GLGCLACL 
CA JAAKCA JTAKGGLGA J TTKGAKATMGAKTGKGAKA AKX TYQRS TAKA TGCCLACLGTL 
GTLTTKGAJATMAAKGGLAAJATGTAKCCLX TYACLCCLQRSGCLTAKACLQRSCA JGAK 
CAJ GGLTTKTGKACLQRSGGLTTKCAJQRSGAJ AAKCAKQR S CA JAA JTGGATMX TYGGL 
GAKGTLTTKATMWGZGAJTAKTAKQRSGTLTTKGAKWGZG Cl.AAKAAKX TYGTLGGKXTY 
G CLAAJGCLATMTGA 
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